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Method and Apparatus for Motion Vector Field Encoding 

The present invention relates to a method and apparatus for encoding 
motion vector fields for sequences of digitised images. 

It is well known to use the principle of motion compensation for 
compression of digitised motion pictures. In the majority of the known 
approaches, including MPEG-1, MPEG-2 and H263, motion compensation is 
performed using square or rectangular blocks of pixels, and a motion vector is 
assigned to each block. For example, it is known to compare a block of 
pixels, for example, a 16x16 block, with blocks in a preceding reference 
image, usually within a limited region of the reference image. The block in the 
reference image which is most similar to the current block under consideration 
is found, using an algorithm such as minimum mean square error, and a 
/ corresponding motion wector is associated with the current block. Thus, 
16x16 blocks in a current image are associated with a respective motion 
^vector, as represented in Fig. 1. The motion vectors are subsequently 
quantized to either full-^el or sub-pel accuracy (usually half-pel or quarter- 
pel) and the result is usually encoded differentially. 

Allocating a single motion sector to a block of pixels in an image in 
the kjiown manner summarised 'above is useful for representing translation 
motion. However, there are a number of disadvantages in a block-based 

" f 

motion compensation approach of the type discussed above that can 



significantly limit the performance in prediction algorithms. For example, 
such an approach does not perform well for regions including motion, such as 
a rotation or a change in scale, which is more complex than simple 
translation^ movement. Also, block-based prediction is poor for non-rigid 
motion, such as that exhibited by clouds or humans, for example. Further, the 
block-based approach imposes motion boundaries along the blocks and may 
not accurately reflect motion boundaries within the blocks. Furthermore, 

reconstructed images may exhibit "blocky" artefacts, especially in situations 

■si 

where a motion boundary runs across a block. 

In order to alleviate such problems, some schemes have employed 
variable block size motion compensation with parametric motion models 
(including quasi-affine and affine motion models in addition to a translation 
-one). In those schemes, the size of the block used for motion compensation 
and the motion model depends oh how many moving objects are present 
within a region and on the complexity of the motion field. Such an approach 
-offers some improvements over the approach using blocks of a fixed size but 
the improvement is limited. 

The present invention provides a method and apparatus for 
representing motion in a sequence of digitized images by deriving a dense 
motign Vector field for an image and performing vector quantization on the 
motion vector field. 



In the context of this specification, the term dense in relation to a 
motion vector field means that for a pixel block of size 8x8 pixels there are at 
least two motion vectors. For example, a motion vector may be assigned to 
each 4x4 pixel block or each 2x2 pixel block. Preferably, a motion vector is 
assigned to each pixel. 

As a result of the invention, an efficient representation of the motion 
field can be obtained for coding. A reconstructed image with improved visual 
quality can be obtained, as a result of better prediction using motion 
compensation and because the number of artefacts can be reduced. In 
particular, blocking artefacts can be reduced or eliminated entirely. Also, the 
approach lends itself to a scaleable representation of the motion field, with 
increased robustness to transmission errors, discussed in more detail below. 

The invention also provides a method and apparatus for representing 
motion in a sequence of digitized images by generating and coding a plurality 
of versions of a motion vector field at different resolutions. , 

As a result of the invention, a scaleable representation of the motion 
field and an embedded *bit stream can be created. Thus, a coarse version of 
the motion vector field can easily be reconstructed by decoding a part of the 
bit stream. This has several advantages. Fpr example, if part of the bitstream 
is corrupted in transmission, a low resolution version of the motion field can 
still be recovered and used for' motion compensation. Alternatively, some 
applications may only require a coarse version of the field, and thus 



processing power and memory requirements can be saved by reconstructing 
the coarse image directly from a truncated bitstream rather than reconstructing 
a full version then sub-sampling it to obtain a coarse version. Such an 
approach may be useful, for example, in video mobile telephones. 

The invention also provides a method and apparatus for pre-processing 
a motion vector field to reduce the entropy without significantly reducing the 
prediction error. This can be done by averaging of the adjoining motion 
vectors, possibly with other constraints which limit the effects that such 
averaging may have on the quality of motion prediction and on the 
preservation of motion discontinuities. 

The invention also provides a method and apparatus for processing 
data relating to an image in a sequence of digitized images comprising which 
identifies where motion discontinuities occur in the image, and smoothes the 
motion vector field by. combining adjacent motion vectors taking account of 
where motion discontinuities occur in the image. i 

As a result, a more accurate reflection of the motion vector fields can 
be obtained. 4 

These and other aspects of the invention are set out in the appended 

claims. 

_ embodiments of the invention will be described with reference to the 
accompanying drawings of which: 



Fig. 1 is a representation of a motion vector field according to the prior 

art; 

Fig. 2 is a block diagram of an encoder according to an embodiment of 
the invention; 

Fig. 3 is a representation of an image region showing discontinuity 

labels; 

Fig. 4 is a block diagram of a component of the encoder shown in Fig. 

2; 

-a 

Fig. 5 is a representation of a bit stream output from the encoder 
shown in Fig. 2; 

Fig. 6 is a block diagram of a decoder according to an embodiment of 
the invention; and 

Fig. 7 is a block diagram of a hybrid DCT/MC codec according to an 
embodiment of the invention. 

Fig. 2 is a block diagram of a multi-resolution motion vector field 

— (MMVF) encoder according to an embodiment of the invention. The MMVF 

encoder as shown in Fi|. 2 includes a motion estimator (ME) module 210 for 

generating a motion vector field and a motion discontinuity map for an image 

in a sequence of images input to the module. The output of the motion 

estimator module is connected to the input of .an entropy reduction module 

220, which is for processing 'the motion vector field using the motion 

■ t 

discontinuity map to produce , an entropy constrained motion vector field with 



discontinuities. The output of the entropy reduction module 220 is connected 
to the input of a multi-resolution vector quantization motion field and 
discontinuities coding module 230, which is for producing a multi-resolution 
vector quantized (MMVF) representation of the motion vector field. 

The operation of the MMVF encoder will be described in more detail 

below. 

The motion estimator module 210 operates on a sequence of input 
image frames to produce dense motion vector fields, each with an associated 
motion discontinuity map. 

A motion vector field for an image frame is derived using motion 
estimation based on the image frame and a reference frame. In this 
embodiment, the estimation is performed using reference frame which has 
been reconstructed from the coded version of the original of the reference 

i frame and the consecutive, original, frame. As an alternative, for example, 

1 

motion estimation could be performed using an original reference frame and 
■-the original consecutive frame. 

The motion estimation is carried out using a variant of a known block- 
matching technique. A block of 16x16 pixels in the image frame is compared 
with blocks in the reference frame. -When the closest match has been found, a 
corresponding motion vector is assigned to a reference pixel, which in this 
embodiment is the top left-hand pixel of the four centre pixels. In other 
embodiments, the. reference pixel can be any pixel within an m x n block, but 



it is preferably close to or at the centre of the block. These steps are repeated 
for overlapping 16x16 blocks in the image frame in order to obtain a motion 
vector for each pixel in the image, and consequently a motion vector field for 
the whole image. Although the motion vector field produced in this manner 
may still exhibit a block-like structure, an accurate dense motion-vector field 
is recovered in the entropy reduction module 220. Other methods for 
obtaining a motion vector for a pixel, such as pel-recursive techniques or 
gradient-based methods can be used, as described in "Digital Pictures - 
Representation, Compression and Standards" by A. Netravali and B.G. 
Haskell, Plenum Publishing 1995. 

The motion estimator module 210 also generates a motion 
discontinuity map reflecting motion discontinuities in the image frame. In 
general, motion discontinuities are located in between pixels, and so each 
i pixel has four discontinuity labels 611, 612, 613, 614 associated with it, one 
for each of the top, bottom, right and left sides, as shown in Fig. 3. Each 

discontinuity label is shared between two adjacent pixels. For example, the 

■ - ... * . . ; • 

right discontinuity labef 612 for pixel 600 is the left discontinuity label 612 
for the pixel 602 neighbouring pixel 600 on the right. 

Motion discontinuities in -the image are identified using a suitable 
techpiqae, such as the technique described in the paper "Robust Motion 
Analysis" by M. Bqber and J/Kittler, CVPR, 1994, pp 947-952. Briefly, 

■ i 

statistical analysis is used to. estimate the spread of residual errors in a block 



with respect to the closest matching block in the previous frame. Pixels 
having a residual error outside an acceptable range are treated as belonging to 
a different motion region from those in the rest of the block. The boundaries 
between different motion regions are the motion discontinuities. 

The motion vector field and the motion discontinuity map derived by 
the motion estimator module 210 are input to the entropy reduction module 
220. The entropy reduction module 220 processes the motion vector field 
estimated in the motion estimator module 210, taking into account the motipn 
discontinuity labels to preserve the motion boundaries, to produce an entropy- 
constrained motion vector field. The process is based on a pel-recursive 
technique, and is discussed in more detail below. 

The processing for a single pixel in the image will be described for a 
single pixel with reference to Fig. 3. The central pixel 600 and its four 
neighbours: top, left, right and bottom 601, 602, 603, 604 are considered. In 
addition, the four motion discontinuity labels 611, 612, 613, 614 for the 
-central pixel 600 are considered. Each pixel has a motion vector associated 
with it, say pixel 600 has a motion vector V600, and so on. After processing, 
a new value of the motion vector V600 is calculated for the central pixel. 

The processing in this embodiment is based on taking a weighted 
averajge'of the motion vectors of the centre pixel 600 and the motion vectors 
of those neighbouring pixels, 60l, 602, 603, 604 which are not separated from 
the centre pixel by an active (iiscontinuity label. More specifically, 



V600x(i + 1) = [(k * V600x(i)) + V601x(i) + 

V602x(i) + V603x(i) + V604x(i)]/(k+4) 
and V600y (i + 1 ) = [(k * V600y (i)) + V60 1 y(i) + 

V602y(i) + V603y(i) + V604y(i)]/(k+4) 
where V60nx(i) and V60ny(i) are the motion vector components 

assigned to pixels 60n in the ith iteration, and k is a constant greater than or 
equal to zero. 

If one or more of the neighbouring pixels is separated from the centre 
pixel by an active motion boundary, it is omitted from the calculation, with 
adjustment of the denominator accordingly. For example, supposing the 
motion discontinuity label 61 1 is active, then V600x(i + 1) is calculated as: 
V600x(i + 1) = [(k*V600x(i)) + V602x(i) + 

V603x(i) + V604x(i)]/(k+3) 
In an alternative embodiment, processing is again based on an average, 
but the prediction error is also taken into account. 
^ More specifically?, % - 

V600x(i + 1) = [(V601x(i) + V602x(i) + V603x(i) 
+ V604x(i))/4] - PE/(m+V 2 I)* Vxl) 

V600y(i + 1 ) = [(V60 1 y(i) + V602y(i) + V603y(i) 
+ V604y(i))/4] - PE/((m+V 2 I)* VyI) 
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Here, PE is the prediction error, that is the difference in luminance 
values between the pixel 600 and the pixel in the reference frame after 
displacement using the motion vector calculated in iteration i, that is V600 (i), 
where V600 (i) = (V600x (i), V600y (i)). Vxl and VyI are the components of 
the image intensity gradient for pixel 600. The image gradient in this 
embodiment is calculated based on the luminance values of the centre pixel 
and a neighbouring pixel. In particular, Vxl = I 602 - I 600 and VyI = I 60l - I 600 , 
where I 60n represents the luminance value of pixel 60n. The image gradient 
can be calculated in other ways using neighbouring pixels. For example, Vxl 

and VyI can be calculated as Vxl = Ie ° 2 ~ /fi04 and VyI = Itox ~ I(>Q3 

2 2 

2 2 2 

V I = (Vxl) + (VyI) , and m is a constant greater than 0. In this alternative 
embodiment, m = 100. Again, if one of the neighbouring pixels is separated 
, by an active motion boundary, it is omitted from the calculation. Thus, 
supposing motion discontinuity' label 61 1 is active, then \ 
; V600x(i + 1) = [(V602x(i) + V603x(i) + V604x(i))/3] 

^ " '] PE /(Cm+V 2 I)* Vxl) 

In each of the alternative processing methods described above, the 
processing is performed for all the pixels in the image, and a number of 
iterations, or passes, are perfopied. The order in which the pixels are 
processed in a single pass does, not significantly influence results. In these 
embodiments, 5-10 iterations are performed, although the optimal number of 
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iterations in other embodiments will depend on the -type of motion estimator 
used. 

As a result of the processing described above, a smoothed, entropy 
constrained, version of the motion vector field is obtained, with the motion 
discontinuities being preserved. By taking account of the motion 
discontinuity labels, and the prediction error in the second processing method 
described above, the smoothing is only performed in the areas where it does 
not reduce the efficiency of the motion compensation prediction. An example 
representation of a region of a motion vector field with motion discontinuities 
as output by the entropy reduction module 200 is shown, indicated as 225, in 
Fig. 2, where the circles represent pixels, the arrows represent motion vectors 
and the line between pixels represents a motion discontinuity. 

The entropy constrained motion vector field with motion 

discontinuities produced by the entropy reduction module 220 is input to the 

multi-resolution motion vector field quantization (MMVFQ) and 

discontinuities coding module 230. 

... * ; ■ 

& f 
Fig. 4 shows the MMVFQ and discontinuities coding module in more 

detail. 

Referring to Fig. 4, the motion vector field obtained from the entropy- 
reduGlicfti module 220 is input info a motion field pyramid module 310, which 
produces a set of n+ J motion fields which are versions of the original motion 
vector field at descending spatial resolutions. The n+1 motion fields have 
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image resolutions sO to sn, where the motion field at the original image 
resolution has resolution sn and the coarsest resolution motion field has a 
resolution sO. A motion field of resolution sm is obtained by low-pass 
filtering and sub-sampling of the higher resolution motion of resolution 
s(m+l). This process is carried out n times starting from the original motion 
field to produce the n+1 fields, called a pyramid of motion vector fields. 

According to this embodiment, the sub-sampling factor k is 2, and the 
low pass filtering process averages the vx and vy values within a 2x2 block. 
The average values vx_aver and vy_aver are then taken to represent the 
motion of the block at the coarser resolution. However, various sub-sampling 
factors k (k>l) and various low-pass filters can be used. 

The pyramid of motion fields is then processed. The motion field at 
resolution sO is encoded by a vector quantization (VQ) encoder 330a using a 
codebook cO. The output from the VQ encoder 330a goes to a module 380a 
for entropy encoding to form a representation of the motion field at the 
-ebarsest resolution sO, and to a VQ decoder 360a. The VQ decoder 360a uses 
the codebook cO to reconstruct the coarse motion field^ which is then passed 
to an up-sampling module 340a where the resolution of the motion field is 
increased by a factor of k. A- differential module 350a calculates the 
difference between the motion' field at resolution si and the up-sampled 
reconstructed motion field obtained from the field of resolution sO. .The 
residual error motion field fat resolution si so obtained is output from the 
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differential module 350a to be processed by the VQ encoder 330b using a 
codebook cl. The steps described above are repeated recursively at increasing 
resolutions until the motion fields at all resolutions up to the original 
resolution have been processed, and n+1 component representations of the 
original motion vector field are obtained. 

The vector quantization mentioned above is carried out on each of the 
motion vector fields in the encoding modules 330a - 330n. The vector 
quantization is analogous to that described in "Scalable image coding using 
Gaussian pyramid vector quantization with resolution-independent block size" 
by L. Cieplinski and M. Bober, Proceedings IEEE International conference on 
Acoustics, Speech and Signal Processing, 1997, vol. 4, pp 2949-2952, where 
vector quantization is described in relation to the luminance values of a still 
picture. The contents of that paper are incorporated herein by reference. 

The vectors to which vector quantization is applied at each resolution 

^ i 

can be formed in one of the following ways: i 

1) By grouping o (o > 1) component velocities from non- 
overlapping regions in the velocity component fields Vx and Vy 
independently. The regions should have identical shape and should cover 
altogether the entire velocity field. For each region, two o-dimensional 
vectQFS dre formed VQx = (vxl, "... vxi, ... vxo), ; VQy = (Vyl, ... vyi, ... vyo), 
where vxi, vyi are the x and y velocity components of the pixel i within the 
block. * 
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2) By performing a transformation on the vector field V, before 
forming the vectors as above. The purpose of the transformation is to make 
the component fields statistically independent, or to reduce their correlation in 
order to improve coding performance. For example, each vector v = (vx, vy) 
at each pixel location within the image may be transformed into log-polar 
representation vp = (vpr, vpa), where components vpa, vpr are defined as: 

vpr = square root (vx*vx + vy*vy) 

vpa = arc tangent (vy/vx), if vx * 0. 
n/2, ifvx = 0and vy >0 
-IT/2, if vx = 0 and vy < 0. 

For each region, two o-dimensional vectors will be formed VQr = 
(vprl, vprj, vpro), VQa = (vpal, vpaj, vpao), where vprj, vpaj are 
obtained from the transformation outlined above. 

3) The component vectors VQx, VQy, as defined in 2), can be 
grouped together to form the vector VQ = (VQx, VQy) or VQ = (VQr, VQa), 
ai}d VQ can be quantize*! Naturallyj, the VQ vector formed in that way has 
the dimension 2o. 

In this embodiment method 2) above is used with rectangular blocks 

of 2x2 pixels, so that VQ is performed on 4 dimensional vectors. 
/ { 

The component representations 370 ... 37n of the motion vector field 
output from the entropy coders 380a - 380n are combined into an embedded 
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representation 260, as shown, for example, in Fig., 5, which shows the 

component representations at the different resolutions separated by separators. 

A decoder 500 for decoding the embedded representation of the 

motion field to reconstruct the motion fields at different resolutions and the 

original motion vector field is shown in Fig. 6. The reconstruction starts from 

the coarsest resolution representation. It is decoded by an entropy decoder 

510a and passed to a VQ decoder 515a, where it is decoded using the 

codebook cO to obtain the reconstructed motion field at resolution sO. The 

reconstructed motion field at resolution sO is then up-sampled by up-sampling 

module 525a, in the same way as in the encoder, to obtain an approximation 

of the field at resolution si. The residual error for the motion field at 

resolution si is then reconstructed in a similar way. The residual error for 

resolution si and the field up-sampled from the field at resolution sO are 

added together in a summation module to create a reconstructed motion field 

at resolution si. The process is repeated at each finer resolution until the 

original resolution motion field is obtained. However, the process can be 
\ • • ' * • , ' ' 

stopped at any resolution before the original resolution if desired. 

Fig. 7 shows an hybrid DCT/MC codec for coding and decoding video 
image data according to an embodiment of the present invention. 

The coder side comprises a DCT module 710 for performing discrete 
cosine transforms oji the input image data. The DCT module 710 is 
connected to a adaptable quantizer 720 for performing adaptive quantization 
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on the DCT coefficients output from the DCT module. The quantizer 720 is 
controlled by a rate controller 730. The output of the quantizer 720 is 
connected to a variable length coder 740 and an inverse quantizer 750. The 
output of the inverse quantizer is connected to an inverse DCT module 760 for 
reconstructing a version of the original frame. The output of the DCT module 
760 is connected to a plurality of frame stores 770. The outputs of the frame 
stores 770 are connected to a multi-resolution motion vector field (MMVF) 
coder 780, for deriving and encoding a motion vector field. The MMYF 

■"k 

encoder 780 is connected to an advanced motion compensation module 790, 

for performing motion compensation in a known manner. The MMVF coder 

780 also outputs data representing a coded motion vector field to the variable 

length coder 740, and the output of the variable length coder 740 is connected 

-to a buffer 800. The buffer 800 is used to adjust the rate controller 730, and 

4 the stored data is output for transmission or recording on a recording medium. 

~~ i 

Corresponding components are provided in the decoder, including a 
HDuffer 810, a variable length decoder 820, an inverse quantizer 830, an inverse 
DCT module 840, an advanced motion compensation module 850, frame 
stores 860, an MMVF decoder 870. 

The coder and decoder operate essentially in a known manner for 
codiog the video image data, dpart from the coding and decoding of the 
motion vector field .using the KlMVF coder 780 and decoder 870. /The 
MMVF coder 78 T 0 and decoder 870 operate essentially as described above. 
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However, here the variable length coder 740 and decoder 820 perform the 
entropy coding and decoding in place of the entropy coders 380a - 380n and 
decoders 515a - 515n as described above. It is possible to use a single 
variable length coder 940 in place of the n entropy coders 380a - 380n, 
employing different look-up tables for the image data at different resolutions, 
and likewise for the decoder 820. 

The efficiency (based on the average number of bits per pixel) of the 
coding as described above can be similar to or better than for known methods. 
Although the allocation of a motion vector to each pixel increases the amount 
of motion information, this is reduced in subsequent processing, that is, in the 
entropy reduction processing and the vector quantization. The average 
number of bits per pixel will, of course, depend on the nature of data being 
coded. 

In the embodiment of the invention described above, a motion 
discontinuity map is derived and used in subsequent processing, but it is riot 
--essential to take account of motion discontinuities. Instead of deriving a 
motion vector for each pixel, as in the described embodiment, the invention is 
also applicable to an approach where a motion vector can be assigned to a 
group of pixels, such as a 2x2 or 4x4 pixel block, for example. An important 
feature however is that the resulting motion vector field is dense. 

The invention is particularly useful for applications dealing with 
sequences of images when one or more of the following conditions apply: 
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i) channel bandwidth is limited 

ii) the risk of corruption of data is high, or 

iii) the user may benefit from obtaining a low-resolution version of 
the motion data. 

Motion information derived and represented in accordance with the 
invention can be stored in database and used for search and browse purpose. 
For example, a person looking for a sequence of particular type of movement 
can first obtain coarse motion information from that database (storing motion 
vector information obtained in accordance with the present invention), and 
then retrieve a selected sequence at original resolution. 



.-7 



Claims 
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1 . A method of representing motion in a sequence of digitized 
images comprising deriving a dense motion vector field for an image and 
performing vector quantization on the motion vector field. 

2. A method as claimed in claim 1 wherein a motion vector is 
derived for pixel blocks sized less than an 8x8 pixel block. 

3. A method as claimed in claim 1 wherein a motion vector is 
derived for each pixel. 

4. A method as claimed in any one of claims 1 to 3 wherein 
vector quantization is performed on the components of the motion vectors 
separately. i 

\ ... si . . / ' 

5. A method as claimed in any one of claims 1 to 4 comprising 
performing variable length coding after vector quantization. 

„ 6. A method as claiihed in any one -of claims 1 to 5 comprising 
identifying where mo.tion discontinuities occur in the image. 
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7. A method as claimed in any one of claims 1 to 6 comprising 
processing the motion vector field to reduce the entropy of the vector field 
before vector quantization. 

8. A method as claimed in claim 7 dependent on claim 6 wherein 
the motion discontinuities are used in the entropy-reduction processing. 

9. A method as claimed in claim 7 or claim 8 wherein entropy- 

/* 

reduction is performed by averaging neighbouring motion vectors. 

10. A method as claimed in claim 9 dependent on claim 8 wherein 
a motion vector separated by a motion discontinuity is not used in the 
averaging. 

i -v 

11. A method as claimed in any one of claims 1 to 10 comprising 
-generating and encoding a plurality of versions of a motion vector field at 

different resolutions. 4 

12. A method as claimed in claim 1 1 comprising sub-sampling the 
motion Vector field to produce sub-sampled versions of the field at a first, 
coarse, resolution and at a second, finer, resolution, coding the coarse 
resolution motion vector field* . comparing the coarse resolution field with the 
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finer resolution field to produce a residual error, and coding the residual error 
at the finer resolution. 

13. A method of processing data relating to an image in a sequence 

of digitized images comprising deriving a motion vector field for the image 

and smoothing the motion vector field by replacing a given motion vector by 

a new motion vector derived using averaging based on adjoining motion 

vectors, the method further comprising identifying where motion 

i 

discontinuities occur in the image and omitting a motion vector or vectors 
from the averaging if they are separated from the given motion vector by a 
motion discontinuity. 

14. A method as claimed in claim 13 wherein the new motion 
vector is derived using a prediction error. 

-~ f ' 15. A method of processing data relating to an image in a sequence 
of digitized images comprising deriving a motion vector field for the image 
and smoothing the motion vector field by replacing a given motion vector by 
a new motion vector derived using averaging based on adjoining motion 
vectors and on a prediction error/ 
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16. A method as claimed in claim 15 comprising identifying where 
motion discontinuities occur in the image and omitting a motion vector or 
vectors from the averaging if they are separated from the given motion vectors 
by a motion discontinuity. 

17. A method as claimed in any one of claims 13 to 16 wherein a 
motion vector is derived for each pixel. 

18. A method as claimed in any one of claims 1 3 to 17 wherein the 
averaging takes account of the given motion vector. 

19. A method as claimed in claim 1 8 using a weighted average. 

20. A method of processing data relating to an image in a sequence 
of digitized images comprising deriving a motion vector field for the image, 

-identifying where motion discontinuities occur in the image, and smoothing 
the motion vector field $>y combining adjacent motion vectors taking account 
of where motion discontinuities occur in the image. 

■ ^ 21. A method as claifned in claim 20 wherein a motion vector is 
derived for each pixel . 
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22. A method of processing an image comprising deriving motion 
discontinuities representing motion boundaries in a motion vector field and 
adjusting motion vectors within each boundary on the basis of other motion 
vectors within that boundary. 

23. A method of representing motion in a sequence of digitized 
images comprising generating and coding a plurality of versions of a motion 
vector field at different resolutions. 

■'k 

24. A method as claimed in claim 23 comprising sub-sampling the 
motion vector field to produce sub-sampled versions of the field at a first, 
coarse, resolution and at a second, finer, resolution, coding the coarse 
resolution version of the motion vector field, comparing the coarse resolution 
field with the finer resolution field to produce a residual error, and coding the 

J: 

residual error at the finer resolution. i 

it ' 

25. A method as claimed in claim 24 comprising producing and 
coding a sequence of residual errors at increasing resolutions. 

vr 26. A method as clairfied in any one of claims 23 to 25 comprising 

multiplexing the coded motion vector field versions at different resolutions 

• t 

into an embedded bit stream^ 
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27. A method as claimed in any one of claims 23 to 25 wherein 
said coding is performed using vector quantization. 

28. An encoder for encoding motion information for a sequence of 
digitized images according to a method as claimed in any one of claims 1 to 
27. 

29. A decoder for decoding motion information for a sequence of 
digitized images encoded according to a method as claimed in any one of 
claims 1 to 27. 

30. An encoder for encoding motion information for a sequence of 
4 digitized images comprising a motion estimator for deriving a dense motion 

vector field for an image and a vector quantizer for coding the motion vector 
—field. 

31. Apparatus for representing motion in a sequence of digitized 
images comprising means for generating and coding a plurality of versions of 
a motioft vector field at different Resolutions. 
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32. Apparatus as claimed in claim 31 comprising means for sub- 
sampling the motion vector field to produce sub-sampled versions of the field 
at a first, coarse, resolution and at a second, finer, resolution, means for 
comparing the coarse resolution field with the finer resolution field to produce 
a residual error, and means for coding the coarse resolution motion vector 
field and the residual error at the finer resolution. 

33. Apparatus as claimed in claim 32 comprising means for 
producing and coding a sequence of residual errors at increasing resolutions, 

34. A decoder for decoding information encoded using an 
apparatus as claimed in any of claims 3 1 to 33, the decoder comprising means 
for reinstating a version of the motion vector field at a desired resolution. 

35. A hybrid DCT-MC codec comprising an encoder according to 
any one of claims 28, 30 or 31' to 33 and a decoder according to claim 29 . or 
claim 34. 

\ ••• « ■ , ' ' 

i 

36. An encoder substantially as hereinbefore described and as 
shown in the accompanying drawings. 

37. A method for encoding motion information substantially as 
hereinbefore described with reference to the accompanying drawings. 
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ABSTRACT 

Method and Apparatus for Motion Vector Field Encoding 



A method and apparatus for representing motion in a sequence of 
digitized images derives a dense motion vector field and vector quantizes the 
motion vector field. 

Fig. 2 
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